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Abstract 

We study the origin of scale invariance (SI) of the degree distribution in scale-free (SF) networks 
with a degree exponent 7 under coarse graining. A varying number of vertices belonging to a 
community or a box in a fractal analysis is grouped into a supernode, where the box mass M 
follows a power-law distribution, P m (M) ~ M~ ri . The renormalized degree k' of a supernode 
scales with its box mass M as k' ~ M e . The two exponents n and 9 can be nontrivial as n 7^ 7 and 
9 < 1. They act as relevant parameters in determining the self-similarity, i.e., the SI of the degree 
distribution, as follows: The self-similarity appears either when 7 < n or under the condition 
9 = (77 — l)/(7 — 1) when 7 > n, irrespective of whether the original SF network is fractal or 
non-fractal. Thus, fractality and self-similarity are disparate notions in SF networks. 
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FIG. 1: (Color online) (a) The protein interaction network of the budding yeast and (b) its coarse- 
graining. Group masses {M a }, detected by the box covering, are heterogeneous. The node size in 
(b) is taken as ~ \JM a for visualization. 

I. INTRODUCTION 

Kadanoff's block spin and coarse-graining (CG) picture is the cornerstone of the 
renormalization-group (RG) theory jj. A system is divided into blocks of equal size and is 
described in terms of the block variables that represent the average behavior of each block. 
Scale invariance at the critical point under this CG enables one to evaluate the critical ex- 
ponents. From a geometric point of view, scale invariance implies the presence of fractal 
structures, and their fractal dimensions are associated with the critical exponents 2j. For 
example, the magnetization and the singular part of the internal energy for the critical Ising 
model are such scale invariant quantities, which are manifested as the fractal geometric 
forms of the area of spin domains and the length of the spin domain interface, respectively. 

The fractal dimension ds of a fractal object is measured by using the box-covering 
method |3|, in which the number of boxes, Nb(£b), needed to tile the object with boxes 
of size £b follows a power law, 

N B {i B )~r B dB . (i) 

This relation is referred to as fractal scaling hereafter. A fractal object is self-similar in the 
sense that it contains smaller parts, each of which is similar to the entire object [sj]. 

While the notions of CG and self-similarity are well established when an object is em- 



bedded in Euclidean space, it is not clear how to extend and apply them usefully to objects 
not embedded in Euclidean space. In the former case, the number of vertices within each 
box, equivalent to the block size in RG terminology and referred to as the group mass for 
future discussion, is almost uniform, and the fractal objects are self-similar and vice versa. 
In the latter case, however, the group mass is extremely heterogeneous so that it follows a 
power-law distribution. This case can happen in scale-free (SF) networks; then, one needs 
to establish the way of CG and the notion of self-similarity in a new setting, which is a 
goal of this paper. A SF network 4j is a network whose distribution Pd{k) of degree k, the 
number of edges connected to a given vertex, follows a power-law form: Pd{k) ~ fc -7 with 
the degree exponent 7. The scale invariance of the degree distribution under CG is defined 
as the self- similarity in SF networks. 

Most SF networks in the real world contain functional groups or communities within 
them In general, the distribution of the group mass M, the number of vertices within 
each group follows a power law asymptotically 6|, |7| : 

P m {M) ~ M~\ (2) 

When such groups are formed within networks, it would be more natural to take each group 
as a unit to form a supernode in CG because the vertices within each group are rather 
homogeneous in their characteristics, such as functionality or working division, and are 
connected densely. In other CG procedures, groups may not necessarily represent functional 
modules in bio-networks and communities in social networks; they can be taken arbitrarily 

or example, being artificially composed or boxes introduced in 



in a theoretical perspective, 



the fractal analysis 
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ll| . Supernodes are connected if any of their merging vertices 
in different communities are connected. This CG method is different from the standard ones 
in view of extremely heterogeneous group masses as shown in Fig. [U We show below that 
the exponent 77 plays a central role in determining the self- similarity. 

So far, several endeavors to achieve a RG transformation for SF networks have been 
carried out. However, their methods remain in the framework of the standard RG method, 
ignoring the heterogeneity of the group-mass distribution. For example, Kim [l2 , [3] applied 
CG to a SF network generated on a Euclidean space [14J], including long-range edges. Taking 
advantage of the underlying Euclidean geometry, the number of vertices within each block is 
uniform and increases in a power law as the block lateral size increases, so that the real-space 
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RG method can be naturally applied. In Refs. [15|] and [la ], the decimation method was 
applied to a few deterministic SF networks, which were constructed recursively starting from 
each basic structure. Those deterministic models restore their shapes under CG achieved 
by decimating the vertices with the smallest degree at each stage. We discuss this case in 
detail later. 

In this paper, we perform the CG of SF networks in three different ways: (i) random 
grouping, (ii) box covering in fractal analysis, and (iii) identifying community structure 
with clustering algorithms. The groups are (i) artificially composed, (ii) taken as boxes 
introduced in the fractal analysis, and (iii) taken as communities embedded within the 
network, in respective cases. For all cases, the group-mass distribution follows the power 
law, Eq. (|2J). We identify the criteria for self-similarity in terms of the exponents t] in Eq. (J2J) 
and 9 which will be introduced below in Eq. ([6]). Such a self-similarity condition holds for 
non-fractal, as well as fractal real-world networks. 



II. CG BY RANDOM GROUPING 



The model enables us to obtain the renormalized degree exponent 7' analytically by using 
the generating function technique, showing that, indeed, 7' depends on 77. The model is 
constructed as follows: (i) We construct a SF network through the static model of Ref. \\\ . 
We choose the degree exponent 7 = 3 and the mean degree (k) = 4. (ii) N individual 
vertices are grouped randomly into N' groups with sizes {M a } (a = 1, . . . ,N'), following 
the distribution function in Eq. (j2J). The exponent rj is tuned. We consider three cases of rj: 
(a) T) = 2 (< 7), (b) rj = 3 (= 7), and (c) 77 = 4 (> 7). (iii) We perform the CG by replacing 
each group with a supernode and connecting them if any of their merging vertices in different 
groups are connected. A renormalized network is constructed. A schematic snapshot of a 
constructed network is shown in Fig. [2j Next, the degree distribution P^(k') ~ k'~ y ' of the 
renormalized network is measured. The result is as follows: 7' = 77 = 2 7^ 7 for rj = 2 (a), 
7' = 7 = 3 for rj = 3 (b), and 7' = 7 = 3 for rj = 4 (c), as shown in Fig. [31 

The numerical result is understood analytically as follows: Since the vertices are grouped 
randomly, every vertex has an equal probability to connect to vertices in other groups per 
edge, so the total probability is proportional to the degree of each vertex. This leads to the 
relation k' a ~ ^2j £a kj, where a is the index of the group. Then, the degree distribution 
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FIG. 2: (Color online) Schematic snapshot of a random SF network with random grouping. 
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FIG. 3: (Color online) Degree distributions of a renormalized network of the static model for 7 = 3 
under CG. The group masses are preassigned to follow a power law P m (M) ~ M _7) with r/ = 2(o), 
3 (□), and 4(A). The degree distribution of the original network is also drawn (•). The dashed 
and the dotted lines with slopes of 2.1 and 3.0, respectively, are drawn as guidelines. 



P' d (k') of the renormalized network is written as 



00 MM 
M=X ki,k2,—,kM 3=1 3=1 



(3) 



S denoting the Kronecker delta. By using the generating function technique, one can find 
the relation V d (z) = V m (Vd(z)), where Vd(z) is the generating function of Pdik) and so 
forth. The Vd(z) is obtained to be 



Vd{z) = 1 - (k)(l - z) + a(l - zf- 1 + O ((1 - z) 2 ) 



(4) 




FIG. 4: (Color online) Box-mass distribution for (a) the WWW and (c) the fractal model network 



of Ref. [20|. Data are for Is = 2 (•) and £b = 5 (■). The solid lines are guidelines with slopes of 
-2.2 and -1.8, respectively, in both (a) and (c). The degree distributions of the original network 
(a) and the renormalized networks with £g = 2 (•) and £b = 5 (■) for (b) the WWW and (d) the 



fractal mode 
from Ref. 11] 



network. The fractal model has a system size N ~ 3 x 10 5 . (c) and (d) are adopted 



where (k) = 'Yl lk kPd{k) and a is a constant. The generating function V m (uj) is also derived 
in a similar form to Eq. (j4]). Then, one can find immediately that 



1 



(5) 



7, for 7 < 7], 
77, for 7 > rj. 

as long as both 7 and rj > 2. Thus, self-similarity holds when 7 < rj, and we can confirm 
that the exponent 77 plays a key role in determining the exponent 7'. In the formulation, the 
relation k' ~ ;c fc,- was crucial. When the relation no longer holds, the derivation of the 
degree exponent 7' is more complicated. This can happen for fractal and some non-fractal 
networks, and also for clustered networks, which we discuss next. 



III. CG BY BOX COVERING 



Recently, it was discovered 8| that fractal scaling, Eq. ([T]) holds in some SF networks, 
such as the world-wide web (WWW), the metabolic network of Escherichia coli, and the 
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protein interaction network of Homo sapiens. Groups are formed by covering the networks 
by boxes that contain nodes whose mutual distances are less than a given box size. The 
group-mass distribution follows the power law of Eq. (j2J), even though the boxes' lateral 
sizes are fixed as Ib for all boxes. Thus, the fractal networks are good objects for our study. 

We first apply a box-covering algorithm to the WWW. Our box-covering algorithm is 
slightly modified from the original one introduced by Song et al. 8j, and the details of the 
algorithm is presented in Ref. 9J. Both algorithms [8j, |9| are identical in spirit and the box 
size £b we use is related linearly to the corresponding one is in Ref. 8[. Next, each box is 
collapsed into a supernode. Two supernodes are connected if any of their merging vertices 
in different boxes are connected. The degree distribution P' d {k') of the renormalized network 
is examined. 

The distribution of box masses is measured and found to follow a power law asymptot- 
ically, P m (M) ~ M~ v [Fig. |4] (a)]. The exponent 77 is found to depend on the box size is- 
For small Ib = 1 or 2, rj m 2.2 is measured, which is close to 7. On the other hand, as £b 
increases, we expect 77 to approach the exponent r = 7/(7 — 1), describing the power-law 



behavior of the cluster-size distribution of the branching tree 18 



We find that 77 « r & 1.8 



for £b = 5. This result can be understood as follows: The WWW contains a skeleton [19(, a 
spanning tree based on the betweenness centrality or load, which can represent the original 



network in the box-covering 



20j • For small I b , the lateral dimension of the box is not large 



enough to see the asymptotic fractal behavior of the spanning tree, so that the number of 
vertices in a given box scales similarly to the largest degree within that box. Thus, 77 = 7. 
As Ib increases, on the other hand, the asymptotic fractal behavior of the spanning tree be- 
comes dominant. Thus, the exponent 77 becomes the same as the exponent r asymptotically, 
which was observed in the case of £b = 5. The case of 77 7^ 7 was not considered in Ref. 8j. 
The origin of the self-similarity is nontrivial as we show below. 

The CG process involves two steps. The first is the vertex renormalization, i.e., merging of 
vertices within a box into a supernode, and the second is the edge renormalization, merging 
of multiple edges between a pair of neighboring boxes into a single edge. The second step 
can yield a nonlinear relationship between the renormalized degree k' and box mass Mb, 
although the total number of inter-community edges from a given box is linearly proportional 
to its box mass (9)]. Thus, we propose that there exists a power-law relation between the 
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FIG. 5: (Color online) Plot of the average renormalized degree {k') versus box mass Mb for (a) 
the WWW and (b) the fractal model network of Ref. [20]. Data are for box sizes is = 2 (•), 
£b = 3 (■), and £b = 5 (a). The solid lines, guidelines, have s 
(a), respectively, for both (a) and (b). (b) is adopted from Ref. 11] 



es of 1.0 (•), 0.8 (■), and 0.6 



average renormalized degree and the box mass: 



(k')(M_ 



Bj 



M 



(6) 



The power-law relation Eq. ([6]) is tested numerically for the WWW, as shown in Fig. [5](a). 
For £b = 2, we estimate 9 ~ 1.0 ± 0.1. Thus, the linear relation holds. For £b = 3 and 
5, however, 9 « 0.8 ± 0.1 and 0.5 ± 0.1, respectively, implying the nonlinear relationship, 
Eq. ([6]). One may doubt the nonlinear behavior due to the scattered data shown in Fig[5](a). 



To confirm this result, we recall a previous study [ll| for the fractal model with 7 = 2.3 
introduced by Goh et al. 20J, where we can reduce the data noise by taking an ensemble 
average over network configurations. We obtained similar results as shown in Fig. [5](b). 
The data for the fractal model are averaged over 10 different network configurations, so the 
nonlinear relationship could be seen more clearly. 

Using P' d {k')dk' ~ P m (M B )dM B and k' ~ M B , we obtain the degree exponent of the 
renormalized network to be 7' = 1 + (77 — l)/9. In short, we argue that Eq. ([5]) should be 
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FIG. 6: (Color online) (a) Box-mass distribution and (b) average renormalized degree (k') versus 
box mass Mb of the Internet at the AS level. Here, the lateral box size is taken as Ib = 2 for both. 
The solid lines, drawn for reference, have slopes of (a) —1.8 and (b) 0.7. 
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generalized to 

7, for 7 < r], 

1 + (r) — l)/9, for 7 > r], 
where 9^1. Accordingly, the self-similarity holds even for 7 > rj when 

= fo-l)/( 7 -l). 



(7) 



(8) 



For £b = 2, we found that 77 ~ 7 and therefore, 7' » 7. For £5 = 5, even though 

6> 7^ 1, plugging 9 m 0.5 ~ 0.6 and 77 ~ 1.8 into Eq. ([7j), we obtain 7' w 2.3 ~ 2.6, which is 
in reasonable agreement with 7 « 2.3. Thus, self- similarity also holds [Fig. |4^b)]. We recall 
the previous study for the fractal model with 7 = 2.3 111], finding that the numerical results 
are the same as those of the WWW, as shown in Figs. H^c) and (d). 

The self-similarity condition in Eq. (jSJ) is also fulfilled for some non-fractal networks. 
The Internet at the autonomous system (AS) level is a prototypical non-fractal SF network. 
However, it exhibits self-similarity. We obtain 77 w 1.8 and the nonlinear relationship in 
Eq. with 9 w 0.7, so that 7^7'^ 2.1(1) for t B = 2, as shown in Fig. [6], satisfying the 




FIG. 7: (Color online) Renormalized degree distributions of real- world, (a)-(c) and (e)-(h), and 
model, (d) and (i), networks under successive CG transformations with a fixed box size £b = 2. 
Symbols represent the original network (o), and the renormalized networks after the first (A) and 
the second (v) iterations. The networks of (a)-(c) are fractals, and those of (d)-(i) are non-fractals. 
The networks of (a)-(f) are self-similar while those of (g)-(i) are not. 



condition in Eq. (jSJ). 

A few deterministic SF networks have been introduced, which were constructed recur- 



sively starting from each basic structure. The pseudofractal model 15 1 introduced by Doro- 
govtsev et al., the geometric fractal model introduced by Jung et al. 16|, and the hierarchical 
model are such examples. These models restore their shapes under the CG achieved by 
decimating the vertices with the smallest degree at each stage. Thus, they are self-similar 
in shape. In those methods, the degree distribution is scale invariant under the CG. How- 
ever, they are not fractal because fractal scaling, Eq.( [T[), i s absent and the fractal dimension 



cannot be defined. The three deterministic models 



1 1 61 ] satisfy the self-similarity con- 
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dition in a trivial manner: 77 = 7 and 6 = 1 under the decimation. Here, the group-mass 
distribution is measured as the distribution of the number of deleted vertices connected to 
each coarse-grained vertex at each decimation stage. Thus, 7' = 7, and the models are 
self-similar, even though they do not follow the fractal scaling, Eq. (jrj. 

We also examine the change of the degree distributions under successive CG transfor- 
mations for the fractal and some non-fractal networks in Fig. [7J We confirm that the 
self-similarity holds even for the non-fractal networks (d)-(f). It is noteworthy that the 
relation in Eq. ([6]) is linear for the first renormalization, but it becomes nonlinear for the 
second renormalization as 9 pa 0.75 and rj pa 1.8 for the WWW. It is also interesting to note 
that the static model with 7 = 2.3 is self-similar in Fig. UJd) while that with 7 = 3.6 in 
Fig. [7Ji) is not. That is due to the topological difference of the SF networks with 7 > 3 
and 2 < 7 < 3 0, [ljj] . When 7 is small, the edges are compactly concentrated around the 
hubs while as 7 grows the edges more globally interweave the network. Consequently, when 
renormalization is performed on the SF network with 2 < 7 < 3, only the nodes around 
each hub in the original network are grouped into a supernode in a coarse-grained network, 
and the supernode again becomes a hub with a corresponding size. On the other hand, 
when 7 > 3, the nodes far from the hubs in the original network have more chances to 
be connected to hubs via global edges, and the supernodes in the coarse-grained network 
become far bigger hubs than those in the original network. The result is more heavily-tailed 
degree distributions in the coarse-grained networks, as seen in Fig. [TJ(i). 



IV. CG BY COMMUNITY STRUCTURE 



Most SF networks in the real world contain functional modules or community structures 
within them, which are organized in a hierarchical manner. While the group-mass distri- 
butions are known to follow a power law in Eq. (j2J), for many cases, however, they exhibit 
a crossover between two distinct power-law behaviors or the power-law behavior appears 
only in a limited range of mass. Also, they are sensitive to various clustering algorithms [3] • 
Thus, it is not easy to find sufficiently good examples of clustered networks with appropriate 
clustering algorithms to test our argument. 



Here, we choose the algorithm proposed by Clauset et al. 2l| and apply it to the cond 



mat coauthorship network 22]. The network data contain 13,861 vertices and 44,619 edges. 
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Unfortunately, the degree distribution of this network is not a power law. Nonetheless, the 
data are clustered into 175 groups obtained at the point where the modularity becomes 
maximum in the clustering algorithm. The group-mass distribution is likely to follow the 
power law in Eq. (J2|), and the exponent is estimated to be rj m 1.6 ± 0.2 in Fig. E^a). Next, 
the CG is carried out; then, the degree distribution of the renormalized network is examined. 
It shows a power-law behavior with exponent 7' ~ 1.8 ±0.2. To check the formula of Eq. ([7j), 
we measure the relationship of Eq. ([6]) between the renormalized degree and the group mass 
in Fig. [H^b). The exponent 9 is measured to be 9 ~ 0.7 ± 0.1. We plug the numerical values 
of r) and 9 into the formula, and obtain 7' « 1.9 ± 0.4, which is in reasonable agreement 
with the measured value 7' ~ 1.8 ± 0.2 in Fig. [H(c). The degree distribution of the original 
network is overall skewed, while that of the renormalized network follows the power law. 
Thus, self-similarity does not hold for this case. 



V. DISCUSSION AND CONCLUSIONS 

For the fractal networks, Song et al. js| showed that in the box-covering method, the 
renormalized degree k! scales as k' ~ s(£B)k m , where s(£b) ~ £~B dk with dk = g?b/(7 — 1) 
and k m being the largest degree in a given box. This form is not directly applicable to 
non- fractal networks, but a modified form of k' max ~ (iV / /A^) 1 ^ 7 ~ 1 ' ) /i; max can be applied for 
non-fractal but self-similar SF networks, where k' max (/c m ax) is the largest degree in the entire 
renormalized (original) network and N'(N) is the total number of nodes after (before) CG. 
The above relation can be easily derived by using the scaling of the natural cutoff of degree, 
Aw ~ iV 1 /^" 1 ) and *4 ax ~ TV'VCt-i). Thus, for SF networks, it is more general to formulate 
a scaling function in terms of the ratio N'/N rather than the length scale £b- 

Although, in this paper, we limited the notion of self-similarity to the scale invariance of 
the degree distribution, one may wonder if other quantities, such as C(k) and (k nn )(k), are 
scale invariant under the CG. We find that such quantities tend to obey the scale invariance 
for the WWW in the box-covering method, but the statistics from real-world networks are 
not sufficiently good to support this conclusion. That means, the self- similarity thus defined 
does not imply any recursive topological identity nor does it even guarantee that the degree 
distribution within each group is identical from group to group. On the other hand, the 
origin of fractality is understood by the power-law relation between the length scale and the 
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FIG. 8: (Color online) When the coauthorship network is clustered, (a) the box-mass distribution, 
(b) the average renormalized degree {k') versus group mass M, and (c) the degree distribution 
of the original (■) and renormalized (•) networks are plotted. Groups are obtained by using the 



211 ]. The solid lines, drawn for reference, have slopes of (a) —1.6, (b) 



clustering algorithm of Ref. 
0.7, and (c) -1.8. 

size of the skeleton underlying the original network In such senses, it seems that the 
fractality describes an important topological feature of SF networks at a fundamental level 
while the self-similarity does not. Moreover, the results we obtained till now empirically 
show that all the fractal networks are self-similar, but the converse is not true. Thus, we 
conjecture that fractality implies self-similarity. 

In summary, we have studied the renormalization-group transformation of the degree 
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distribution, in particular, when the numbers of renormalized vertices with each block follows 
a power- law distribution, P m (M) ~ M~ v . We found that the average renormalized degree 
scales with the box mass as (/c')(M#) ~ M e B . The two exponents rj and 9 can be nontrivial 
as rj 7^ 7 and 9^1. They act as relevant parameters and are analogous to the scaling 
exponents associated with the magnetization and the singular part of the internal energy in 
the renormalization group theory. We obtained the degree exponent 7' of a remormalized 
network in terms of rj and 9. Many non-fractal networks are self-similar. The notions of 
fractality and self-similarity are disparate in SF networks, which is counterintuitive in view 
of their equivalence in Euclidean space. 
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